Baby Gym For Robots: A New Platform For Testing Developmental Learning Algorithms
نویسنده
چکیده
This extended abstract describes a new platform for robotic manipulation research that was inspired by some of the first toys that human infants learn to manipulate. It summarizes the results of our existing research on pressing buttons and formulates some ideas for future work. Motivation and Inspiration Figure 1: My son in his Baby Gym. When my son was growing up I noticed that it took him almost 5 months to learn how to press a button reliably on one of his toys. The toy had a number of buttons, sliders, levers, and knobs that could be manipulated (see Figure 1). These toys come in many varieties and in recent years some toy manufacturers have started to call them Baby Gyms. Initially my son was more interested in exploring the plastic casing of the button instead of the button itself. He had to learn what the button looks like and how and where to press it. This learning process started when he was 4.5 months old and continued until he was 9 months old. Once he had mastered this skill, however, he knew how to detect other buttons around the house. Furthermore, he knew that these devices are for pushing and he knew how to push them. Light switches, keyboard keys and microwave buttons were now intuitively obvious to him as they were in the same equivalence class with that first button. Similar developmental sequences were observed for a number of other devices present in the baby gym. Copyright © 2011, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. Learning to Press Buttons The devices found in baby gyms – buttons, sliders, levers, wheels, and knobs – are widely used in human environments. Buttons alone are present in virtually every gadget that humans have ever created. Robots operating in human environments would have to know not only how to navigate without hitting any obstacles, but also how to operate these devices by actively touching and manipulating them. Otherwise these robots would not be very useful. These manipulation tasks are still challenging for robots yet one-year-old infants can perform them quite easily. This suggests that the exploration methods that infants use may hold the key to solving such tasks with robots. These methods rely on exploratory behaviors such as pushing, scratching, and nudging, which can be quite powerful when combined with multimodal perceptual change-detection functions. One key advantage of the baby gym platform is that it offers a nice way to have multiple reproducible experiments. These observations motivated a pilot study to test if a robot can learn to press buttons in a similar way (Sukhoy et al. 2010; Sukhoy and Stoytchev 2010). That is, to test if a robot can learn both where and how to press buttons from its own experience without prior knowledge of what buttons look like. We built an experimental fixture similar to a baby gym (see Figure 2) that contained multiple doorbell buttons. The results showed that the robot was indeed able to learn to press buttons autonomously. Furthermore, the robot simultaneously learned a visual model for what a button looks like and was able to use it to detect novel buttons. Figure 2: The experimental fixture with doorbell buttons inspired by my son’s baby gym. Figure 3: Mapping between the baby gym toys and other similar devices around my house.
منابع مشابه
Cooperative Control of Mobile Robots in Creating a Runway Platform for Quadrotor Landing
Multi-agent systems are systems in which several agents accomplish a mission in a cooperative manner. In this paper, a novel idea for the construction of a movable runway platform based on multi-agent systems is presented. It is assumed that an aerial agent (quadrotor) decides to make an emergency landing due to reasons such as a decrease in energy level or technical failure, while there is no ...
متن کاملOptimal Trajectory Planning of a Mobile Robot with Spatial Manipulator For Spatial Obstacle Avoidance
Mobile robots that consist of a mobile platform with one or many manipulators mounted on it are of great interest in a number of applications. Combination of platform and manipulator causes robot operates in extended work space. The analysis of these systems includes kinematics redundancy that makes more complicated problem. However, it gives more feasibility to robotic systems because of the e...
متن کاملTitle: Modeling Behavior Cycles as a Value System for Developmental Robots Running Title: Behavior Cycles in Robots
The behavior of natural systems is governed by rhythmic behavior cycles at the biological, cognitive and social levels. These cycles permit natural organisms to adapt their behavior to their environment for survival, behavioral efficiency or evolutionary advantage. This paper proposes a model of behavior cycles as the basis for motivated reinforcement learning in developmental robots. Motivated...
متن کاملSensor-rich robots driven by real-time brain circuit algorithms
The analysis of particular telencephalic systems has led to derivation of algorithmic statements of their operation, which have grown to include communicating systems from sensory to motor and back. Like the brain circuits from which they are derived, these algorithms (e.g. Granger, 2006 ) perform and learn from experience. Their perception and action capabilities are often initially tested in ...
متن کاملForward kinematic analysis of planar parallel robots using a neural network-based approach optimized by machine learning
The forward kinematic problem of parallel robots is always considered as a challenge in the field of parallel robots due to the obtained nonlinear system of equations. In this paper, the forward kinematic problem of planar parallel robots in their workspace is investigated using a neural network based approach. In order to increase the accuracy of this method, the workspace of the parallel robo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011